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CO ' ABSTRACT 

\ The Fisher matrix approach (jFisher I (|l935t )) allows one to calculate in advance how 

well a given experiment will be able to estimate model parameters, and has been an 
invaluable tool in experimental design. In the same spirit, we present here a method 
to predict how well a given experiment can distinguish between different models, re- 
gardless of their parameters. From a Bayesian viewpoint, this involves computation of 
the Bayesian evidence. In this paper, we generalise the Fisher matrix approach from 
. the context of parameter fitting to that of model testing, and show how the expected 

' evidence can be computed under the same simplifying assumption of a gaussian like- 

£ — . \ lihood as the Fisher matrix approach for parameter estimation. With this 'Laplace 

approximation' all that is needed to compute the expected evidence is the Fisher ma- 
trix itself. We illustrate the method with a study of how well upcoming and planned 
experiments should perform at distinguishing between Dark Energy models and mod- 
ified gravity theories. In particular we consider the combination of 3D weak lensing, 
for which planned and proposed wide-field multi-band imaging surveys will provide 
suitable data, and probes of the expansion history of the Universe, such as proposed 
supernova and baryonic acoustic oscillations surveys. We find that proposed large- 
scale weak lensing surveys from space should be able readily to distinguish General 
Relativity from modified gravity models. 

■ Key words: methods: statistical, cosmological parameters, dark energy 
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1 INTRODUCTION 

While the goal of parameter estimation is to determine the 
best fit values (and the errors) of a set of parameters within a 
model, model selection seeks to distinguish between different 
models, which in general will have different sets of param- 
eters. Model selection has received at tention in co smology 
only relatively recently (starting with Ijaffel l|l996l )). Only 
in recent years have cosmological data had enough statis- 
tical power to address the problem, although elsewhere in 
astr onomy model selec t ion h as been applied for some time 
(e.g. iLucv fc Sweeney I l|l97ll) ). 

For param eter estimation, the Fisher matrix approach 
(|Fisher I (| 19351 )) has been invaluable in experimental design. 
It allows one to forecast how well a given experiment will 
be able to estimate model parameters. In the same spirit, 
one may want to predict how well a given experiment can 
distinguish between different models. 

Of particular interest is the case of nested models, where 
the more complicated model has additional parameters, in 
addition to those in the simpler model. The simpler model 



may be interpreted as a particular case for the more com- 
plex model where the additional parameters are kept fixed 
at some fiducial values. The additional parameters may be 
an indication of new physics, thus the question one may ask 
is: "would the experiment provide data with enough statis- 
tical power to require additional parameters and therefore 
to signal the presence of new physics if the new physics is 
actually the true underlying model?" 

Examples of this type of questions are: "does the pri- 
mordial power spectrum deviate from scale invariance?" , 
"do the data require a running of the primordial power 
spectrum spectral index?", "do observations require a 
dark energy that deviates from a cosmological constant 
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In this paper we present a general statistical method for 
rapidly calculating in advance how well a given experimental 
design can be expected to distinguish between competing 
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(and in particular nested) theoretical models for the data, 
under the Laplace approximation. 

We then present an application of the method to the 
cosmological context: we forecast how three proposed weak 
lensi ng experiments may dis ti nguish between D ark Energy 
(e.g. IPeebles fc Ratra I (|l988l):IWetterlich I dl988l) and modi- 
fied gravity models (e.g. iDvali. Gabadaze fc Porratil (|2000l ) 
(DGP)). " 

The standard cosmological model is extremely success- 
ful: with only 6 parameters it fits a host of observations 
and provides a description of the Universe from z=1100 to 
present day. This finding is in good agreement with what can 
be expec ted of current data, based on Baye sian complexity 
theory f (|Kunz. Trotta fc Parkinson I (|2006l )). The parame- 
ters of the model are tightly constrained, many at the per- 
cent level, and there is a considerable weight of evidence 
in favour of a substantial contribution of Dark Energy. Al- 
ternative explanations for the apparent Dark Energy are 
a Cosmological Constant, a slowly rolling scalar field or a 
fluid component, whose effects on the expansion history can 
be described by an equation of state parameter which may 
evolve in time, or a large-scale modification to General Rel- 
ativity. 

Each of these can be described as a different model. 
An interesting question is which of these possibilities 
is favoured by the data. In contrast to parameter es- 
timation, this is an issue of model selection, which has 
been the subject of rec ent atten tio n in cosmo log y (e 
Hobson. Bridle fc LahaTl (|2002| ): ISaini et all (f20cT 
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references the rein). In particular, Mukheriee et al. (2006); 



ITrotta I {2006) compare evolving dark energy models with 
a cosmological constant. 

Dark Energy has potentially measurable effects through 
its effect on both the expansion history of the Universe, 
and through the growth rate of perturbations. Some meth- 
ods, such as study o f the lumino s ity d istance of Type la 
supernovae (SN; e.g. iRiess et al. I (Il998l)). baryoni c acous- 
tic oscillations (BAO; e.g. lEisenstein fc Hu I ll 19981 ) ) or g e- 
ometric weak lensing methods (e.g. Taylor et al. I (|2007 



probe only the expansion history, whereas others such 
as 3D cosmic shear weak lensing or cluster counts can 
probe both. Combinations of various probes promise very 
accurate determinations of the e quation of state (e.g. 
iHeavens. Kitching fc Taylor I <|2006l )). 

For methods based on probing the expansion history 
alone, there is a difficulty in that the same expansion his- 
tory of a universe where the law of gravity has been modi- 
fied on large scales can also be obtained in a universe with 
standard General Relativity but a dark energy component 
with a suitable equation of state parameter w(z). In general 
however the growth history of c osmological structu r es wi ll 
be different in the two cases fee. iHuterer fc Linder I ^2006), 
but see lKunz fc Sap one I (|2006| )). 

Therefore in principle there are advantages in using 
methods which also probe the growth rate, such as 3D 
weak lensing. It becomes a very interesting question to ask 
whether such methods could distinguish between the Dark 
Energy and modified gravity scenarios. 

This question may be answered in a Bayesian context 



by considering the Bayesian Factor, which is the ratio of 
the Bayesian Evidences, i.e. the ratio of probabilities of the 
data given the two models. The evidence ratio may be gen- 
eralised to a genuine posterior probability of the models by 
multiplying by a ratio of priors of the models. The evidence 
involves an integration of the likelihood, multiplied by pri- 
ors, over the parameter space of each model, and this can 
be computationally expensive if the dimension of the param- 
eter space is large. By making a simplifying assumption in 
the spirit of Fisher's analysis, one can compute the expected 
evidence for a given experiment, in advance of taking any 
data, and forecast the extent to which an experiment may 
be able to distinguish between different models. The Fisher 
matrix approach in parameter estimation assumes that the 
expected behaviour of the likelihood L near the maximum 
characterises the likelihood sufficiently well to be used to 
estimate errors on the parameters (i.e. the Taylor expansion 
of In L to include second-order terms holds at least until In L 
drops by ~ 1). The Fisher Matrix is defined by 



F af3 



d 2 InL 



(1) 



where a are the model parameters, and (InL) ~ (InL ) — 
F a /3 \_9 a — #°«] [00 — /2, where indicates peak values. 
In this paper, we compute the expected evidence by as- 
suming that we can ignore higher-order terms in the Taylor 
expansion throughout. This allows a major simplification, 
called the Laplace approximation, in that the expected ev- 
idence can be computed directly from the Fisher Matrix 
at essentially no extra cost. Of course, one must make the 
caveat that the quadratic expansion of InL may be a poor 
approximation in some cases, but nevertheless the compu- 
tation of the expected evidence may be a useful first step. 

The layout of this paper is that we present the formal 
computation of the ratio of expected evidences (also called 
the Bayes factor) in Section 2, and apply the method to 
a combination planned and proposed data sets (3D weak 
lensing, microwave background measurements, supernovae 
and baryonic acoustic oscillations probes) in Section 3. 



2 FORECASTING EVIDENCE 

The aim here is to compute the Bayesian Evidence ratio for 
two different models, i.e. given a dataset arising from a true 
model, we want to know the probability with which a second 
model can be ruled out. 

The spirit of this is very similar to the Fisher Matrix 
approach for parameter estimation, where one computes the 
expected likelihood as a function of parameters, in the ab- 
sence of data. 

We make the approximation that the expected likeli- 
hood is everywhere accurately described by a multivariate 
Gaussian, with a curvature given by a Taylor expansion at 
the expected peak. Thus it should be regarded as a first 
step; more sophisticated simulation techniques may be nec- 
essary if this assumption is not adequa t e. A s pecial case of 
this was recently presented bv iTrottal (|2007h . However, it 
is worth pointing out that it is routine to compute expected 
marginal errors on parameters using the inverse of the Fisher 
matrix. This is equivalent to assuming the Laplace approx- 
imation, and doing only one fewer integration than we pro- 
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pose her e. In some cases, the approx i matio n works very well 
(see e.g. ICornish N.J.. Crowder J~l i|2005l )), but there are 
certainly some exampl es where the approxi mation is not par- 
ticularly accurate fe.g. IWang et al.T(|2004 )V For the Planck 
microwave background experiment we find reasonable agree- 
ment between Fisher and Monte Carlo Markov Chain errors 
(to within about 30%), if the same likelihood calculat ions 
are us ed for both. Note that for this paper, we follow iHu I 
(2002) for the likelihood, which is accurate for the noise- 
dominated regime. For Planck, the errors will therefore be 
rather conservative. 

The other approximations we make are that the priors 
on the parameters are uniform, but this could be relaxed to 
Gaussians if desired. We also assume that the priors on the 
two models are the same; this could easily be relaxed as it 
just adjusts the normalisation. 



2.1 Models and Notation 

We denote two competing models by M and M'. We assume 
that M' is a simpler model, which has fewer (n' < n) param- 
eters in it. We further assume that it is nested in Model M' , 
i.e. the n' parameters of model M' are common to M, which 
has p = n — n' extra parameters in it. These parameters are 
fixed to fiducial values in M'. 

We denote by D the data vector, and by 8 and 8' the 
parameter vectors (of length n and n'). 

The posterior probability of each model comes from 
Bayes' theorem: 



p(M\D) 



p(D\M)p(M) 

pW) 



(2) 



and similarly for M' . By marginalisation p(D\M), known as 
the Evidence, is 



p(D\M)= d0p(D\0,M)p(0\M), 



(3) 



which should be interpreted as a multidimensional integra- 
tion. Hence the posterior relative probabilities of the two 
models, regardless of what their parameters are, is 



(4) 



p(M'\D) _ p(M') f d0' P (D\0',M')p{9'\M') 
p{M\D) ~ p(M) f d6p(D\8,M)p(8\M) 

With non-committal priors on the models, p(M') — p(M), 
this ratio simplifies to the ratio of evidences, called the Bayes 
Factor, 



B 



f d0' p(D\8',M')p(8'\M') 
f d8p(D\8,M)p(6\M) 



(•>) 



Note that the more complicated model M will inevitably 
lead to a higher likelihood (or at least as high), but the 
evidence will favour the simpler model if the fit is nearly as 
good, through the smaller prior volume. 

We assume uniform (and hence separable) priors in 
each parameter, over ranges A8 (or AO'). Hence p(8\M) = 
(A0i . . . Adn)- 1 and 



B 



f d8'p(D\8',M') A0i...A0„ 



(6) 



fd8p(D\8,M) A8[...A8' n / 
Note that if the prior ranges are not large enough to contain 



essentially all the likelihood, then the position of the bound- 
aries would influence the Bayes factor. In what follows, we 
will assume the prior range is large enough to encompass all 
the likelihood. 

In the nested case, the ratio of prior hypervolumes sim- 
plifies to 

ak'.'.'.ao" = A9n ' +1 ■ ■ ■ A9n ' +P ' (7) 

1 n' 

where p = n — n' is the number of extra parameters in the 
more complicated model. 

The Bayes factor in equation (((J still depends on the 
specific dataset D. For future experiments, we do not yet 
have the data, so we compute the expectation value of the 
Bayes factor, given the statistical properties of D. The ex- 
pectation is computed over the distribution of D for the 
correct model (assumed here to be M). To do this, we make 
two further approximations: first we note that B is a ra- 
tio, and we approximate (B) by the ratio of the expected 
values, rather than the expectation value of the ratio. This 
should be a good approximation if the evidences are sharply 
peaked. 

We also make the Laplace approximation, that the ex- 
pected likelihoods are given by multivariate Gaussians. For 
example, 

\{d-d ) a F afj {d~d°)p] , 



(p(D\9,M))=L exp 



(8) 



which is centred on 8°, the correct parameters in M. 

A similar expression is assumed for {p(D\8' , M')). The 
Laplace approximation assumes that a Taylor expansion of 
the likelihood around the peak value to second order can 
be extended throughout the parameter space. F a g is the 
Fis her matrix, given for Gaussian-distr ibuted data by (see 
e.g. iTegmark. Taylor fc Heavens I (|l997l )) 



= i T r 
2 



c^c^c^c,? + c- l {p,„ii* ta + ti, a iM* p )] . (9) 



C is the covariance matrix of the data, and fj, its mean 
(no noise). Commas indicate partial derivatives w.r.t. the 
parameters. For the correct model M, the peak of the ex- 
pected likelihood is located at the true parameters 8°. Note, 
however, that for the incorrect model M', the peak of the 
expected likelihood is not in general at the true parameters 
(see Fig.[T]for an illustration of this). This arises because the 
likelihood in the numerator of equation ([6| is the probability 
of the dataset D given incorrect model assumptions. 

If we assume that the posterior probability densities are 
small at the boundaries of the prior volume, then we can 
extend the integrations to infinity, and the integration over 
the multivariate Gaussians can be easily done. This gives, 
for M, (27r) n/2 (detF)- 1/2 , so for nested models, 



(B) = (27T) 



-p/2 



VdetF L' 



A8 n ,, 



(10) 



VdetF 7 L 

An equivalent expression was obtained, us- 
ing again the Laplace a pprox imation by 
lLazarides. Ruiz de Austri fc Trotta I (|2004h . The point 
here is that with the Laplace approximation, one can com- 
pute the Lq/Lo ratio from the Fisher matrix. To compute 
this ratio of likelihoods, we need to take into account the 
fact that, if the true underlying model is M, in M' (the 
incorrect model), the maximum of the expected likelihood 
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Figure 1. Illustrating how assumption of a wrong parameter 
value can influence the best-fitting value of other model param- 
eters. Ellipses represent iso-likclihood surfaces, and here in the 
simpler model, the parameter on the horizontal axis is assumed 
to take the value given by the vertical line. Filled circles show the 
true parameters in the more complicated model, and the best-fit 
parameters in the simpler model. 



will not in general be at the correct values of the parameters 
(see Fig. [TJ . The n' parameters shift from their true values 
to compensate for the fact that, effectively, the p additional 
parameters are being kept fixed at incorrect fiducial values. 
If in M' , the additional p parameters are assumed to be 
fixed at fiducial values which differ by Sip a from their true 
values, the others are shifted on average by an amount 
which is readily computed under the assu mption of the 
multiv ariate Gaussian likelihood (see e.g. iTavlor et al. I 
il2007l) 1: 

6# n = -(F , - l } aB G B eWt a, (3= l...n',( = l...p (11) 



where 

G K = ^Tr [C- X C^C- X C A + CTVcA^ + > ( 12 ) 

which we recognise as a subset of the Fisher matrix. For 
clarity, we have given the additional parameters the symbol 
; f = 1 . . . p to distinguish them from the parameters in 
M'. 

With these offsets in the maximum likelihood parame- 
ters in model M' , the ratio of likelihoods is given by 



L' = L exp (—-86 a F a p56p 



(13) 



where the offsets are given by 88 a = 88' a for a ^ n! (equation 
Hip , and 58 a = 5il> a _ n i for a > n' . 

The final expression for the expected Bayes factor is 

then 



(B) = (2^ 2 -^g£exp {-\56 a F a0 56 ) (^,+,.(14) 



9 = 1 



Note that F and F~ are nxn matrices, F 1 is n xn' , and G 
is ann'xp block of the full nxn Fisher matrix F. The ex- 
pression we find is a specific example of the Savage-Dickey 



ratio (|Dickev I (|l97lh); here we explicitly use the Laplace 
approximation to compute the offsets in the parameter esti- 
mates which accompany the wrong choice of mo del. 

Note that the 'Occam's razor' term, see ISaini et al.l 

|2004h for example, common in evidence calculations, is en- 



capsulated in the (27r) 



factor multiplied by the 



Vdet F' 

prior product: models with more parameters are penalised in 
favour of simpler models, unless the data demand otherwise. 
Su ch terms should be treat ed with caution, as pointed out 
by iLinder fc Miguel! l|2007h simpler models do not always 
result in the most physically realistic conclusions (but see 
iLiddle et al. I ([2006) for a thorough discussion of the issues). 
In cases where the Laplace approximation is not a good 
one, other tec hniques must be used , at more compu t ationa l 



expense ( e.g. iTrottal ([200 



2007T); iBeltran et al. I (12005 
Mukhcrice, Parkinson & Liddlc (2006) 
Parkinson. Mukheriee fc Liddlei (120061 ); iMukheriee et al. 



Skilline (2004); 



12006)). Alternatively, a reparametrization of the parame- 
ter space can ma k e the likelihood closer to gaussian (see e.g. 
iKosowskv et al. I (12002ft for CMB) . 

According to iJeffrevs J |l96lj), 1 < \nB < 2.5 is de- 
scribed as 'substantial' evidence in favour of a model, 2.5 < 
ln_B < 5 is 'strong', and ln£> > 5 is 'decisive'. These de- 
scriptions seem too aggressive: In B = 1 corresponds to a 
posterior probability for t he less-favoured model which is 
0.37 of the favoured model |Kass fc Rafterv I l|l995l ) ). Other 
autho rs have introduced different terminology (e.g. iTrotta I 

(Hq3)). 



3 APPLICATION: DARK ENERGY OR 
MODIFIED GRAVITY 

To apply these results to cosmological probes of Dark 
Energy /modified gravity, we use the convenient Minimal 
Modif ied Gravity parametri zation introduce d by Linder I 
j|2005h and expa n ded by ILinder fc Cahn I (120071 ) and 
iHuterer fc Linderl (|2006l ). where the beyond-Einstein per- 
turbations are described by a growth factor 7. The growth 
rate of perturbations in the matter density p m> 8 = Sp m /pm, 
is accurately parametrised as a function of scale factor a(t) 
by 



exp 



{jT^lo-M'-i]} 



(15) 



where Q m (a) is the density parameter of the matter. The 
growth factor 7 ~ 0.55 for the standard General Relativis- 
ts cosmological model, whereas for modified gravity theo- 
ries it deviates fro m this value. For example, for the DGP 
bran eworld model dDvali. Gabadaze fc Porrati I ([2000)), 7 ~ 
0.68 (|Linder fc Cahn I (|2007f )1. on scales much smaller than 
those where cosmological acceleration is apparent. For this 
paper, we introduce 7 as an additional parameter in Model 
M - i.e. M represents extensions beyond General Relativ- 
ity, whe reas M' represents Gener al Relativity with Dark 
Energy. ISong. Hu. Sawicki I l|2007h show that the simplest 
flat DGP model demonst r ates a difficulty pointed out by 
llshak. Upadve fc SpergeTI (120061 ). i.e. inconsistent param- 
eter values are obtained from different datasets. Also the 
DGP model may b e in difficulties with the CMB peak an d 
baryon oscillations l|Rvdbeck. Fairbairn fc Goobar I (|2007h ). 
as well as theoretically through the existence of ghosts 



On model selection forecasting, Dark Energy and modified gravity 5 



jGorbunov. Kovama fc Sibirvakovl (|2006l )). Here we con- 
centrate instead on distinguishing modified gravity models 
from GR using a different feature - that the growth factor 
is different from that of a quintessence model with the same 
expansion history. We use the DGP model as a specific ex- 
ample of a more general test of modified gravity models. 

We make the (conservative) assumption that modifica- 
tion of gravity does not change the growth factor of pertur- 
bations on scales comparable to the horizon, i.e. in all cases 
the Integrated Sachs Wolfe (ISW) effect at large CMB an- 
gular scale is computed assuming the perturbation growth 
is given by standard gravity. In addition for the calculation 
of the CMB low I multipoles we assume the dark energy 
perturbations associated to a scalar field with the same ef- 
fective equation-of-state parameters as the modified gravity 
model. This is a conservative assumption as in a modified 
gravity model the ISW effect is expected to be different from 
standard gravity and therefore CMB observations may have 
some extra sensitivity to 7 which we ignore here. 

The full set of parameters we explore in M' is 
h, as, n 3 ,a n ,T,r, wo, w a , being the density parame- 
ters in matter and baryons, the Hubble constant (in units of 
100 km _1 Mpc _1 ), the amplitude of fractional density per- 
turbations, the primordial scalar spectral index of density 
fluctuations, and its running with wavenumber k, the reion- 
isation optical depth, the tensor-to-scalar ratio; finally, there 
are two parameters characterising t he expansion history of 
the U niverse, w(a) = wo + w a (l — a) |Chevallier fc Polarski I 
1120011 )). For Dark Energy models, this is the equation of 
state parameter p/(pc 2 ) as a function of scale factor a. How- 
ever, it is used here only as a means of parametrising of 
the expansion history, in terms of an effective Dark En- 
ergy component - w(a) is not necessarily associat e d wit h 
a Dark Energy compo nent . See iHuterer fc Linder I ^2006) ; 
iLinder fc Cahn I (|2007h and lKunz fc Sapone I For ex- 

ample, in the DGP model the expansion history is described 
well by wo = —0.78, w a = 0.32. The Fisher matrices are al- 
most unchanged if we take this as the fiducial model, so we 
present results for wo — — 1, w a = 0. 7 is an additional pa- 
rameter in M (set fixed at 0.55 in M'), which parametrises 
the growth of structure For completeness, we list here the 
other fiducial parameters: fi m = 0.27, fit = 0.04, h — 
0.71, a 8 = 0.8, n 3 = 1.0, a„ = 0.0, r = 0.09, r = 0.01. 

Thus the question we want to address is the following: 
assuming that the model of the Universe is a modified grav- 
ity model, is there an experimental setup which can distin- 
guish this model from a Dark Energy model with the same 
expansion history? In this application we initially take the 
parameters of the model to be the DGP ones. 

The experiments we consid er are the upcoming P lanck 
microwave background survey (|Lamarre et al. I l|2003l )). in- 
cluding polarisation information, three 3D weak lensing sur- 
veys and proposed SN and BAO surveys. Note that, as dis- 
cussed above, we set the CMB constraint on 7 to zero. The 
constraint on r and r from the weak lensing is similarly zero 
- these are assumed fixed in the weak-lensing alone experi- 
ments; in the weak-lensing plus CMB, the constraints on r 
and r come from the CMB. 

We consider a number of 3D weak lensing surveys: 
firstly a survey covering 5000 square degrees to a median 
redshift of z m — 0.8 with a source density of 10 galaxies per 
square arcminute, such as might be achieved with the Dark 



Survey 


Area/sq deg 




no/sq ' 


(T z (z) 




DES 


5,000 


0.80 


10 


0.05(1 + 


*) 


PS1 


30,000 


0.75 


5 


0.06(1 + 


*) 


WL NG 


20,000 


0.90 


35 


0.025(1 4 


-*) 



Table 1. The critical survey parameters for the Weak Lensing 
experiments considered. DES is the Dark Energy Survey, PS1 
is the Pan-STARRS single telescope 3tt survey, and W Lj^q is 
a hypothetical 'next-generation' imaging survey, such as DUNE, 
SNAP, or LSST. Zm is the median redshift, tiq the number of 
sources per square arcminute, and <r z is the assumed photometric 
redshift error. 



Energy Survey (DES) (|Wester et al. I (|2005h ): second a sur- 
vey co vering 30, 000 square degrees of the sky (|Kaiser et al. I 
(2002)) to a median depth of z m = 0.75 with 5 galaxies per 
square arcminute, as might be achieved with Pan-STARRS 
(we consider only the single-telescope Pan-STARRS 1); 
third is a survey of 35 sources per square arcminute, z m — 
0.90, and an area of 20, 000 square degrees (next generation 
weak lensing survey, WLjvg), as might be observed by a 
space-based survey such as DUNE, which is a candidate for 
the ESA Cosmic Vision programme, or the Supernova Ac- 
celeration Probe (SNAP), a candidate of the NASA Joint 
Dark Energy Mission. Note that the characteristics of the 
Large Synoptic Survey Telescope (LSST) data set are not 
too dissimilar from these, so the reported numbers would 
be very close to those for LSST. For all surveys we as- 
sume flatness and a redshift dependence of source density 
n(z) oc z 2 exp [— (2/z*) 1 ' 5 ] , with z* = lAz m and use the 
3D we ak s hear power spectrum analysis method of [Heavens] 
i|2003l ) and lHeavens. Kitching fc Taylor I (|2006l ). The modes 
are truncated at k — 1.5Mpc , avoiding the highly non- 
linear regime where uncertainties in t he power spectrum 
may l ead to dominant systematic errors l|Huterer fc Linden 
l|2006t ). Fig. 4). 

The survey parameters are summarised in Table 
[T] including the photometric redshift error a z and 
the number of sources per square arcminute, no. The 
Fisher matrices for the four experiments are available at 
http:/ /www.roe. ac.uk/~afhl As there is a degeneracy be- 
tween wo, w a and 7 for Planck+Wh, better constraints on 
the Universe expansion history lead to a better determina- 
tion of 7 and therefore better model selection power. For 
probes of the expansion history we consider supernovae and 
a sample of 2000 supe rnovae type la at < z ^ 1.8 (see 
IVirev et all il2004l) andlYeche et al. I (I2006TI) as prod uced by 
SNAP l|Aldering et al. I l|2004l ); I Albert et al. I l|2005l )) or the 
Advanced Dark Energy Probe Telescope (ADEPT, a can- 
didate for the NASA Joint Dark Energy Mission; C. Ben- 
nett, private communication). For BAO we consider a wide 
survey contemplated by WFMOS (|Bassett et al. 1 1)20051 )) or 
ADEPT. 

From these Fisher matrices (the Fisher matrix of a com- 
bination of independent data sets is the sum of the individ- 
ual Fisher matrices), we compute the ratio of expected evi- 
dences assuming that the true model is a DGP braneworld, 
and take a prior range A7 = 1. Table [2] shows the expected 
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evidence for the 3D weak lensing surveys with and without 
Planck. 

We find that \nB obtained for the standard General 
Relativity model is only ~ 1 for DES+Planck, whereas 
for Pan-STARRS+PZancfc we find that kiB ~ 2, for 
P&n-STARRS+Planck+SN+BAO hxB ~ 3.61 and for 
WLng+ Planck, lnB is a decisive 52.2. Furthermore a 
WLng experiment could still decisively distinguish Dark 
Energy from modified gravity without Planck. The ex- 
pected evidence in this case scales proportionally as the 
total number of galaxies in the survey. Pan-STARRS and 
Planck should be able to determine the expansion history, 
parametrised by w(a) to very high accuracy in the context 
of the standard General Relativity cosmological model, with 
an accuracy of 0.03 on w(z ~ 0.4), it will be able to substan- 
tially distinguish between General Relativity and the sim- 
plification of the DGP braneworld model considered here, 
although this does depend on there being a strong CMB 
prior. 

The relatively low evidence from DES+Planck in com- 
parison to Pan- STARRS + Planck is due to the degeneracy 
between the running of the spectral index a, and 7. The 
larger effect volume of Pan-STARRS, in comparison to DES, 
places a tighter constraint on this degeneracy. This could be 
improved by on-going high resolution CMB experiments. 

Alternatively, we can ask the question of how different 
the growth rate of a modified-gravity model would have to 
be for these experiments to be able to distinguish the model 
from General Relativity, assuming that the expansion his- 
tory in the modified gravity model is still well described 
by the wo,w a parametrization. This is shown in FigJ5] It 
shows how the expected evidence ratio changes with pro- 
gressively greater differences from the General Relativistic 
growth rate. We see that a WLjvg survey could even distin- 
guish 'strongly' <5 7 = 0.048, Pan-STARRS Sj = 0.137 and 
DES 5"/ — 0.179. Note that changing the prior range A7 by 
a factor 10 changes the strong/decisive boundary for #7 (for 
WLng +Planck) by ~ 0.012, so the dependence on the prior 
range is rather small. 

If one prefers to ask a frequentist question, then a com- 
bination of WLjvG+P'ancfc+BAO+SN should be able to dis- 
tinguish 57 = 0.13, at 10.6cr. Results for other experiments 
are shown in Fig. [5] 

The case for a large, space-based 3D weak lensing survey 
is strengthened, as it offers the possibility of conclusively dis- 
tinguishing Dark Energy from at least some modified gravity 
models. 



4 CONCLUSIONS 

We have shown in this paper how one can compute the 
Bayesian Evidence under the assumption of a Gaussian like- 
lihood surface, taking account of the fact that assuming the 
wrong model choice can affect the best-fitting values of pa- 
rameters in the models. The assumption of a Gaussian like- 
lihood is the same as used in the Fisher matrix approach to 
forecasting parameter errors, and we find that the Evidence 
can be calculated directly from the Fisher matrix alone, and 
with very little extra computation. An important caveat is 
that the assumption that the likelihood is a multivariate 
Gaussian is very strong, and deviations from this could eas- 
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Figure 2. The expected value of | lnB| from WLjvg (solid), Pan- 
STARRS (dot-dashed) and DES (dashed), in combination with 
CMB constraints from Planck, as a function of the difference in 
the growth rate between the modified-gravity model and General 
Relativity. The crossover at small 67 occurs because Occam's ra- 
zor will favour the simpler (General Relativity) model unless the 
data demand otherwise. To the left of the cusps, GR would be 
likely to be preferred by the data. The dotted vertical line shows 
the offset of the growth factor for the DGP model. The horizon- 
tal lines mark the boundaries between 'inconclusive', 'significant', 
'strong', and 'decisive' in Jeffreys' (f961) terminology. 



Survey 


V 


|ln£| 




DES+Pkncfc+BAO+SN 


3.5 


1.28 


substantial 


DES+PZancfc 


2.2 


0.56 


inconclusive 


DES 


0.7 


0.54 


inconclusive 


PSl+PZancAc+BAO+SN 


2.9 


3.78 


strong 


PSl+Planck 


2.6 


2.04 


substantial 


PS1 


1.0 


0.62 


inconclusive 


WL NG +Planck+BAO+SN 


10.6 


63.0 


decisive 


WLNG+Plo-nck 


10.2 


52.2 


decisive 


WL NG 


5.4 


11.8 


decisive 



Table 2. The evidence ratio for the three weak lensing experi- 
ments considered with and without Planck, supernova and BAO 
priors. WLpfG is a next-generation space-based imaging survey 
such as proposed for DUNE or SNAP. For completeness, we also 
list the frequentist significance va with which GR would be ex- 
pected to be ruled out, if the DGP braneworld were the correct 
model. 



ily change the evidence substantially. Nevertheless, the as- 
sumption is the same as is used in estimating marginal errors 
with Fisher matrices; for the CMB part used here we find 
agreement with 30% accuracy (note our Fisher matrices for 
Planck are conservative). However, this method should only 
be regarded as a first step, to identify which experimental se- 
tups are worth exploring with more detailed investigations. 
For the 3D weak lensing part of the study in particular, this 
method is extremely valuable, as simulating the surveys and 
subsequent analysis is an enormously time-consuming task. 
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We have also shown how future observations of the Cos- 
mic Microwave Background, 3D weak lensing and probes 
of the expansion history of the Universe offer considerable 
promise of distinguishing decisively between Dark Energy 
models and modifications to General Relativity. In partic- 
ular, a combination of Planck (CMB) and proposed space- 
based wide field imaging (weak lensing) surveys should be 
able decisively to distinguish a Dark Energy General Rela- 
tivity model from a DGP modified-gravity model with nat- 
ural log of the expected evidence ratio ln_B ~ 50. 

Surveys such as DUNE/SNAP/LSST, Pan-STARRS 
and DES, in combination with Planck, should be able 
to distinguish 'strongly' between General Relativity and 
minimally-modified gravity models with growth rates larger 
than 0.60, 0.69 and 0.73 respectively. The addition of probes 
of the expansion history of the Universe, such as supernova 
and baryon acoustic oscillation surveys help lift residual de- 
generacies and thus to distinguish 'strongly' models with 
growth rates larger than 0.59, 0.67 and 0.71 respectively. 
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